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Abstract 

Many organisms live in populations structured by class and location, exhibit plastic responses 
to their environment and their social partners, and are subject to non-additive ecological and 
fitness effects. Social evolution theory has long recognized that all of these factors can lead 
to different selection pressures but has only recently has attempted to synthesize how these 
factors interact. Using models for both discrete and continuous phenotypes, we show that 
analyzing these factors into a consistent framework reveals that they generically interact with 
one another in ways not previously emphasized or recognized. Specifically, behavioral responses, 
genetic assortment, and synergy all interact in non-trivial ways that cannot be summarized by 
simple summary indices. In contrast, some recent attempts at synthesis yield flawed conclusions 
because they neglect these interactions, and thus lead to an incomplete picture of how natural 
selection shapes social phenotypes. We suggest that these interactions should be a boon for 
empirical research as they are likely crucial for describing the complexity of biological systems. 
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1. Introduction 

Explaining the evolution of social behaviors has been a goal of evolutionary theory going back 



behaviors both cooperative 
, which showed that genetic 



to Darwin's time. The modern effort to explain the origin of socia 
and noncooperative started with the seminal work of Hamilton [l[ [2] 
relatedness has a profound influence on evolutionary dynamics. Later work added other significant 
factors, starting with conditional and responsive behaviors 0, 0] and extending to non-additive 
a—a—fl. Be ginning in the 1980's, researchers analyzed many models that apply and 
extend these mechanisms [e.g., extending reciprocity to include indirect responses in ref|6|]. At the 
same time, empirical researchers tested the role of various mechanisms developed in the theoretical 
literature [e.g., LZHSf] . 

Whereas early discussions of social evolution usually pitted one mechanism against another, 
recent efforts are increasingly focused on synthesizing and integrating the various pathways of 



social evolution [e.g., 



1CH15||. This is perhaps the surest sign that social evolution theory has 



entered a mature phase. These efforts at synthesis are not always without controversy, but they 
have served to illuminate the connections between various models devised over the decades. In this 
paper, we aim to contribute to this synthetic effort by focusing specifically on the relation between 
three important concepts: assortment of genotypes (relatedness), behavioral responsiveness (e.g., 
reciprocity), and non-additive (e.g., synergistic) effects of social behaviors on fitness. These topics 
have of course been written about extensively in the past; yet, how these concepts relate to one 
another is still not well understood. 

Two of the most fundamental mechanisms for the evolution of cooperation are responsive (and 
conditional) behaviors such as direct and indirect reciprocity, and genetic assortment that creates 



relatedness between individuals 111 ]. Both factors are clearly in operation in many, if not most 
social interactions. Cooperative breeding, for example, almost always involves interactions with 
relatives of varying degree. At the same time, most cooperatively breeding species also exhibit 



conditional behavior; for example, non-helpers might get evicted from the breeding group 



id- 



Likewise, reciprocal food-sharing among vampire bats can occur amongst relatives as well as non- 
relatives 

Non-additive interactions between individual phenotypes also play a prominent role in social 
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evolution. Sometimes, the non-additivity is a direct function of organismal metabolism, as in the 
case of the plant pathogen Agrobacterium tumefaciens, which induces infected hosts to produce 
opines, which can be used as a resource only by those cells that also carry the virulence plasmid 



181 ] . In other cases, the ecological context might create non-additive payoffs. For example, in 



the penduline tits (Remiz pendulous), males contribute to the building of an elaborate nest, while 
females lay eggs inside the nest If]]. After eggs are laid, one or both of the parents desert the 
nest, which is then tended either by a single parent or by neither parent. Data on breeding success 
suggests that the reason for this behavior is in part that a single parent has better than even 



chance of raising the clutch 



201 ] . Hence, the second parent adds less to breeding success than the 



first parent, meaning a negative non-additivity. It has been argued that the opposite is true for 
many bird species with biparental care [2l(]. It is worth noting that in both of these examples, 
as perhaps with most instances of non-additive interactions between phenotypes, the traits in 
question are plastic. For example, the Agrobacterium pathways inducing opine production are 
not expressed constitutively and instead depend on quorum-sensing [18]. Likewise, the male and 
female decisions to desert in penduline tits appear context-dependent; the same individual can care 
or desert according to factors such as brood size 



19, 



22]- 



Yet, despite enormous theoretical work on conditional behaviors, genetic assortment, and non- 
additive payoffs, the interplay between these three factors has attracted relatively little attention 
^ recent* fl Q QQ. So.e of the recent wo rk sy ntne sizi n g these different factors mo he S 
strong claims about the relationships between them. For example, Fletcher & Doebeli [141 ] asserts 
that the fundamental condition favoring cooperation is assortment between cooperative genotypes 
and cooperatively behaving partners, regardless of whether the assortment is caused by conditional 
jehaviors or population structure. Other authors, notably Queller jj], and Fletcher & Zwick 
23] ]. claimed that reciprocity is inherently a multiplicative or non-additive interaction between 
phenotypes. However, as we show below, both of these claims rely on models of limited generality 
and do not hold in general. Our aim is therefore to sort out the exact relationships between 
behavioral responses, genetic assortment, and non-additivity. 

Our paper beings with two models: first, we introduce a discrete-action model to clarify the 
relationship between genetic assortment, behavioral responses, and synergistic or non-additive in- 
teractions; second, we generalize the results of the discrete-action model by presenting a population 



genetic model for the evolution of a continuous trait in a structured population, where the struc- 
ture can be due to any combination of age, stage, sex, or spatial location. Using these models, 
we show that the claims that reciprocity and genetic assortment are simply equivalent mechanisms 
generating phenotypic assortment [l4] and that their effects contribute additively to the response 
to selection 15[ are generally incorrect. Instead, these factors have independent effects that can 
interact in non-linear ways. We also show that synergistic interactions, while important, are not 
necessary for capturing the effect behavioral responses and reciprocity. Rather, synergistic inter- 
actions can affect the stability of social behaviors or even change the payoff structure of the game 
entirely. 

2. Discrete action model 

2.1. Reciprocity and genetic assortment 

Our model beings with a 2x2 game that is played repeatedly between two individuals a large 
number of times. We take the game to be symmetric for ease of exposition, but this assumption 
can be relaxed with no consequence for our main argument. Label the two actions C and D. Most 
models of such repeated games focus on particular conditional strategies (e.g., tit-for-tat [4j] and its 
numerous relatives). We would like a behavioral model that admits a wider range of propensities to 
respond to one's opponent. Hence, we assume that each individual % has two traits: a^, which is the 
action (C or D) that the individual i intrinsically "prefers", and pi, the probability that individual 
i overrides its intrinsic preference and reciprocates its opponent's action in the last round. Hence, 
in common parlance, dj corresponds to whether individuals are cooperators or defectors, and pi 
denotes the degree to which individuals are reciprocators. Both traits are genetically encoded; we 
take cij to be binary and pi to be a continuous trait between and 1. As an aside, a straightforward 
extension of this model could treat as a continuous trait by interpreting it as the intrinsic 
probability of playing C vs. D. 

Consider an interaction between an intrinsic cooperator and defector (i.e., a\ = C and 02 = D). 
Suppose at stage t, player 1 (the C-type) played C and player 2 (the D-type) also played C (because 
he was reciprocating cooperation in the previous round). We label this situation as (C, C), where 
the elements correspond to the first and second players' actions, respectively. Let us label the 
response coefficients of the two individuals as p c and pd in according with intrinsic preferences for 
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C and D, respectively. In that case, the probability that at stage t + 1 they will keep playing (C, C) 
is the probability that the D-type individual keeps reciprocating, i.e., pd- Similarly, the probability 
that the t+1 stage will result in (C, D) is 1 — pd, as this is the probability that the -D-type will revert 
to his intrinsic preference. Finally, the probabilities of playing (D, C) and (D, D) at t + 1 given the 
actions at t is zero, since regardless of whether she plays her intrinsic preference or reciprocates, 
the C-type player will always play C . In a similar way, we can work out the probabilities of playing 
any action pair at t + 1 conditional on the action pair at t, which is summarized with the transition 
matrix M = [mjj], 

Pd (1-Prf) o N 

~Pc)pd (l-p c )(l-pd) pcPd pc{l-pd) 

10 

1 - p c Pc J 

where rriij gives the probability of being in state j at t + 1 when the game was in state i at t, 
and states are ordered as {(C, C), (C, D), (D, C), (D, D)}. We assume that the game is played for 
a very large number of periods with no discounting, so that the proportion of time each pair of 
actions is played is given by the stationary distribution of the Markov chain given by the transition 
matrix M. Taking the dominant left-eigenvector of M and normalizing it, we find the stationary 
state distribution of the game between a C- and D-type individual, ir to be 

7T = ((1 — Pcfpd , (1 - Pc)(l - Pd) , (1 - Pc)Pc(l - Pd)Pd , Pc{l ~ Pd) 2 ) (1 - PcPd)' 2 ■ (2) 



M 



The expected payoff of a C-type player from playing D-type player is the fraction of time spent 
at each outcome (given by the elements of ir) multiplied by the payoff to her from that outcome. 
Suppose that at each period the game has the familiar structure of the prisoner's dilemma (in the 



other-only benefit form 



261]): cooperation yields a benefit of b to the partner, at a net cost of c 



to the cooperator. Defection carries no cost and yields no benefit. Thus, the expected payoff to a 
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C-type player when playing a £>-type, wcd, is given by 

_ (I - Pc)(bp d - c) 

WCD — : • yJ>) 

1 - PcPd 

With a similar calculation, we find the expected payoff to D-type from playing a C-type to be 

{I - p c ){b - cp d ) 

wdc = 1 • (4) 

1 - PcPd 

The expected payoff to a C-type from playing another C-type is trivial; since both players start 
with (C, C) and keep playing these actions, wcc = b — c. Similar reasoning yields wdd = 0. 

In order to determine which type of individual is preferred by natural selection, we calculate 
the condition when the frequency of intrinsic cooperators with responsiveness p c increases in a 
population with intrinsic defectors of responsiveness p d . Suppose that some mechanism (e.g., kin- 
recognition, population viscosity, or greenbeards) causes cooperators to assort themselves so that 
an intrinsic cooperator interacts with another intrinsic cooperator with probability r c ; likewise, 
intrinsic defectors assort among themselves with probability r d . When not assorting specifically 
with their own types, individuals are paired with a partner randomly chosen from the population 
(this is analo gou s to the classic assumption concerning populations with related individuals; see 
equation 1 in [27]). A full description of the dynamic of frequency change in the population would 
require a specification of the mechanism that generates assortment, and how it depends on the 
frequency of genotypes. However, we can leave this unspecified if we only require a qualitative 
picture of frequency change over the course of a single generation. 

Given our assumptions above and that the frequency of the intrinsic cooperative genotype is q, 
the expected fitnesses of an intrinsic cooperator and an intrinsic defector are, respectively, 



(5) 



W c = (r c + (1 - r c )q) w C c + (1 - r c )(l - q) w C d and 
W c = (1 - r d )qw DC + (r d + (1 - r d )(l - q) w DD . 

The condition for increase of intrinsic cooperators is Wc — Wo > 0, which leads to 

b > 1 - (1 - q)Pc ~ qPd + (1 ~ q)r c Pc (1 ~ Pd) + qr d (1 - Pc) Pd 
c qp c + (1 - q)p d + (1 - q)r c (1 - Pd) + qr d (1 - Pc) - PcPd 
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What this equation entails can be seen better by looking at three special cases. First, suppose 
individuals do not respond to each other, p c = Pd = 0, and that the assortment probabilities are 
equal, r c = = r. In this case, condition © becomes - > which of course is Hamilton's rule 
l|. Second, suppose both the intrinsic cooperators and defectors have the same responsiveness, 
Pc = Pd = Pi an d assortment is zero, r c = r d = 0. Then, condition © evaluates to - > -, which is 
equivalent to other rules for the evolution of responsiveness {^j and reciprocity [2^ • Finally, 
giving intrinsic cooperators and defectors the same assortment probability and responsiveness, 
r c = r d = r and p c = p d = p, yields 



b 1 + rp 
- > - 

c r + p 



(7) 



In condition ([7]), the assortment coefficient r and responsiveness p have symmetric effects on the 
increase condition but are also separate because both the product and the sum of p and r appear. 
Hence, assortment of individuals and conditional behaviors cannot be collapsed to a single index, as 
Fletcher & Doebeli 1J] have suggested. Not coincidentally, expression (|7|) is identical in structure to 



the condition for an evolutionary increase in cooperation we have previously derived for continuous 



public goods games 



If 



241 ] . This further reinforces the 



251 ] and to other previously derived results 
point that this basic symmetry and irreducibility is not due to a quirk in this particular model. 

2.2. Reciprocity and synergism 

We can use the same model to investigate the interplay between reciprocity and synergism, the 
latter defined as an extra benefit (or cost, for negative synergism) when two cooperators interact. 



Since Queller's seminal paper in 1985 



5[, synergism, or non-additive interactions in general, appears 



23, 



30]. Fletcher & Zwick 



2|, 



in several efforts to synthesize social evolution theory [e.g. 
for example, consider synergistic interactions as the consequence of reciprocal altruism, and thus 
propose that Queller's 1985 rule should serve as the basis of integration for these historically distinct 
explanations for cooperation. We show in the next section that reciprocity need not imply non- 
additivity in models of continuous traits. In this section, we show that reciprocity and synergism 
in phenotypes (i.e., some non-additive payoff to cooperative behavior) have distinct effects on the 
dynamics of social traits. 



As is customary in models of synergism, we modify the prisoner's dilemma game matrix above 
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by adding a constant d to the payoffs of both individuals when they both cooperate. When d > 0, 
two cooperators produce more benefit together than twice the benefit of a single cooperator; when 
d < 0, they produce less. These cases relate to cooperative acts by the partners being strategic 
complements versus substitutes, respectively, as defined in economics [31]. 

Using the same methodology above, we calculate the expected payoff to an intrinsic cooperator 
when paired with an intrinsic defector and an intrinsic defector against an intrinsic cooperator as 

(1 - p c ) {{bp d - c)(l - p c p d ) + dp d (l - p c )) 

w C D = r, v? and 

(i - pcPdY , g , 

(1 - p c ) ((b - p d c){l - PcPd) + dp d (l - p c )) 

WDC = no i 

(1 - PcPdY 

respectively. These expressions indicate that as with reciprocity and population structure, reci- 
procity and payoff synergism also interact in more complex ways than recognized. To better illus- 
trate the interaction, suppose that both types of players have the same probability of reciprocation, 
Pc = Pd = Pi an d the same probability of assortment, r c = r d = r. Then, as shown in Appendix 
lAl the component that the payoff synergy d adds to the condition for cooperation to increase, 
W C -W D > 0, is 



,| p+r p. ri+ rfi-ya-') ), (9) 



where the first component, p + r(l — p) matches previous work on synergy without reciprocity 



32, 



331 ]. The first thing to note from Q is that, compared the non-synergistic case in condition 



0, 



synergy adds frequency dependence to the increase condition. This is a well-known result 



33 35] that complicates the analysis of synergy. Essentially, this frequency dependence arises from 
explicitly accounting for the frequency of different pairings, which is necessary when payoffs are 
not additive. Another way of thinking about this frequency dependence is that while relatedness 
or genetic assortment generally accounts for pairwise genetic correlations, accounting for synergy 



requires calculating triplet correlations, which are generally much more tedious to calculate 



35l . for 



example]. In models with continuous phenot ype s, assuming that the two competing phenotypes are 



very similar (i.e., weak phenotypic effect; 



27|, 



36, 



371). the effect of non-additive payoffs are neglected 



to first order in the difference between the phenotypes (defined as 5 in section [3]). However, as we 
will see below in section O synergy can still have an effect in terms of which phenotypes are stable 
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in the long term. 

Second, the evolutionary game, defined by the expected payoffs from the repeated game, wcc, 
wcd, WDC-, an d wdd, will take on different game structures depending on the values of d and p. 
Figure Q] illustrates the different possibilities. With p < c/b, a high enough synergism can transform 
the game from a prisoner's dilemma into a coordination game, which will have two evolutionarily 
stable strategies (ESSs): all cooperators or all defectors. If p > 0, then still higher d can turn 
the game into a mutualism game, where intrinsic cooperators have an unconditional advantage. 
When p > c/b, the possibility of generating a coordination game disappears, and increasing d turns 
the prisoner's dilemma into an anti-coordination game (i.e., a Hawk-Dove or Snowdrift game) and 
eventually into a mutualism game as d increases further. It it well known that reciprocity in the 



38|-l40|, for 



form of repeated game strategies such as tit-for-tat can transform the payoff matrix 
example], but to our knowledge, the interaction between reciprocity and synergy demonstrated 
in Figure Q] has not been recognized. This interaction is particularly important in the light of 
recent results that show that genetic non-additivity by itself does not change the conditions for 
the evolution of cooperation [35], unless it changes the structure of the evolutionary game into a 
coordination or anti-coordination game, which have different equilibrium structures and produce 
different patterns of frequency dependence. 

3. Population genetic model 

While the discrete-action model above highlights the interplay between genetic assortment, 
behavioral responsiveness, and phenotypic non-additivity, it is a restrictive model. A more general 
population genetic framework will allow us to generalize the above results to interactions among 
an arbitrary number of individuals who may be from different demographic classes (e.g., males and 
females, queens and workers, juveniles and adults, philopatric and dispersed). A population genetic 
model will also show us how synergistic interactions that result from nonlinear ecological benefits 
and costs can be crucial in the long-term evolutionary stability of phenotypes. The study of such 
synergies as they relate continuously to ecological payoffs is relatively new [e.g., 



25 



28, 



41] and 



presents an opportunity to make predictions about the effects of specific ecological mechanisms on 
the evolution of specific behaviors. At the same time, such generality involves a significant increase 
in the complexity of the model, and hence a different set of simplifying assumptions come into play. 
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In particular, we are interested in determining when a mutant allele that codes for some phe- 
notype with social effects will, on average, increase in frequency in a population and reach fixation. 
The proper quantity to calculate in this case is the fixation probability. For structured popula- 
tions, the fixation probability may be impossible to obtain analytically, so we follow a standard 
approach that approximates the that probability using a first-order a Taylor-series expansion in 
the difference between the mutant and resident phenotypes, S 



27, 



42]. Population genetics theory 



shows that the first-order term in the expansion is sufficient to calculate the standard conditions 
for the evolutionary stability of phenotypes 



27, 



421 ] . Moreover, this first-order term is proportional 



42 



451 ]. We do this using an "individually" 



46, 



47]. 



to a quantity that is much easier to calculate, namely the derivative of the expected change in the 
frequency of the mutant allele with respect to 5 or dAq/ d5. It is this derivative that we calculate 
below. 

First, we will write a general expression for Aq in a class structured model using standard theory 
in population genetics and evolutionary demography 

centered approached that was popularized in population genetics by the Price equation 
Suppose that there are K classes in the population, where an individual fc's class could represent 
the local population it lives in, its age or developmental stage, its sex, or some other phenotypic 
feature. Each class contains 7V& individuals and the total population size is N = Y^k-Nk- In order 
to track allele frequencies in different classes, we let q = (. . . , qjk, ■ ■ •) be a column vector of length 
N where qjj~ is the frequency of the allele in individual k in class j. Average allele frequencies in 
each class in the next time period are calculated by multiplying a transition matrix W = \wijk[ 
times q, where is the probability that an allele in a random individual in class i descended 
from individual k in class j. The transition matrix is a function of the fitnesses of each class and the 
transmission processes that affect how alleles move between classes such as segregation, mutation, 
and migration. We assume that the fitness of individual k in class j is function of its phenotype, pjk- 
In the simplest cases, phenotype depends on genotype in a linear way where pjk = p + qjk8 and 8 is 
a measure of how much the mutant differs from the resident phenotype p. For non-additive genetic 
interactions (such as dominance or epistasis), phenotype depends on genotype in a nonlinear way 



34] 



To calculate Aq, we need to average the class frequencies in such a way that the average 
frequency remains constant in the absence of natural selection. The appropriate weights for this 
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average are the reproductive values of each class [43|, |48|, [49(, a = (. . . , ctk, . . .). Thus, the change 
in the reproductive value weighted allele frequency is 



Aq = a • W • q — a • N • q 



(10) 



where N is a K x N matrix used to create class averages of the allele frequencies q and whose A;-th 
row has the value 1/iVfc in the columns 1 + Yl^i Ni to Ya=i Ni and zero elsewhere for all k. Using 
equation (fTOj) . the derivative of Aq with respect to 5 is given by 



dAq 



a ■ 



dW 

~d5 



K Nj N t „ 

E\ \ ^ OWijk 
i,j,l=l k=l m=l 



(11) 



The components of equation (|11|) are reproductive value, ctj, the effect of one individual's phenotype 
on another individual's probability of transmitting an allele from one generation to the next, d ™ lik 
and the probability that those two individuals both have the mutant allele, qjkQim- When this last 



component, qjkQim^ is averaged over all individuals in a class, we obtain a genetic identity probabi 



that can be related to identity by descent probabilities from classic population genetics 
that emerge as more common quantities such as Fst 



27 



4a 



ity 



511 ] in simple models of population subdivision 



501 ] such as the island model [521 ] . Moreover, these genetic identity probabilities correspond to the 



genetic covariance terms found in regression models in social evolution [53|],which when normalized 



54, 



55] 



by genetic variance correspond to the regression definition of relatedness 

Although equation (jlip allows us to see how evolutionary change should generally rely on genetic 
covariances to first order, understanding the interplay between ecological and genetic mechanisms 
requires unpacking the derivatives of the transition matrix, ^"' Jfc . Evaluating these derivatives can 
be very tricky if the demographic or class structure affects the social interaction since we must in 
effect track the fertility, demographic, and genetic parameters of each class separately. Thus, we 
will assume a relatively simple social interaction that is unaffected by the class structure. Assume 
that each individual in each class participates in a social interaction with n — 1 other individuals 
from the same class (e.g., males interact with males, females with females, adults with adults, etc). 
Each individual choose actions in this social interaction (e.g., an amount of effort in some helping 



behavior); we denote the behavioral equilibrium 



25 



281 ] or time averaged values of the action of 



11 



individual k in class j by a*- k . One crucial quantity of interest is the marginal change in the action 
of one individual in response to another; this is the response coefficient p from section 12. 1( which 
can be written as pj^i = gjp- for the change in the action of individual I in response to a change 

jk 

in the action of k, both of class j 



25 



28j . Since we assume that the social interaction is unaffected 
by class, we can neglect subscripts that specify class in variables involved in the social interaction; 
thus, the response coefficient can be written as pj^i = Pki- 

The actions individuals choose in a social interaction are determined by some proximate be- 
havioral mechanism whose biological components (e.g., neurological structures) are genetically de- 
termined (whereas actions themselves are not inherited). For simplicity, we assume that a single 
"motivational" phenotype p characterizes the proximate mechanism, and it is the evolution of this 
phenotype that we track with equation ()10p . One way of measuring how the motivational pheno- 



type affects actions is which is the effect of individual fc's phenotype on the action of I. The 
response coefficient can be expressed as a function of these derivatives: pki = = |^ / 

The outcome of the social interaction determines the fertility of each individual in the interaction 
(fjk for individual k in class j); for example, an individual's fertility might be determined by the 
net calories it obtains from participating in group hunting and foraging activities. Since the social 
interaction does not depend on class, fertilities are independent of class (fjk = fipjk))- We can 
define the "costs" and "benefits" of the social interaction in terms of fertility as — c = J^r, where "•" 
represents the focal individual, and b = Jj^r for some other individual "o" . Given these assumptions, 
we show in Appendix |B] that dAq/ d5 is 

^— ? oc bp(n - 1) - c + n (n - 1) [b (1 + p(n - 2)) - pc] = S , (12) 
do 

where p is the response coefficient in a resident population monomorphic for the phenotype p and b 
and c are evaluated at the resident phenotypic value p. Equation f)12|) is the condition for increase 
of the phenotype p to first order in the phenotypic deviation, and S > is the n-player analog 



of inequality ([7]). This result was previously derived by Akgay & Van Cleve 



251 ] and is analogous 



to expressions found in McGlothlin et al. [24( and Lehmann & Keller |llfl. It is important to 
note that equation f)12|) is essentially a rearrangement of Hamilton's rule [lj where the effect of 
kin competition is moved from the benefit and cost terms into the scaled relatedness-coefficient 1Z 



12 



421 . l5Cll . l56l. |57J]. We derive an expression for 1Z in Appendix [Bl 

The first thing to note about equation (|12() is that it incorporates our measure of reciprocity, 
the responsiveness coefficient p, without requiring a non-additive genetic model, contrary to pre- 
vious suggestions that non-additive interactions must be the basis of integrating reciprocity with 



genetic assortment 



HQ 



23J. This implies that, to first order in the strength of selection, the 
effect of reciprocity is not frequency dependent even though reciprocity may still generally produce 
frequency-dependent selection. One reason to emphasize this distinction is an implicit assumption 
in the literature that frequency dependent selection implies the need to consider second-order and 
higher terms in Taylor expansions of the change in allele frequency [e.g. Q]. Secondly, equation 
(|12h shows that relatedness r and responsiveness p cannot be both replaced by a single index of "as- 



sortment" 14]. which reinforces the point made in section I2TT1 that relatedness and responsiveness 
are distinct causal factors that interact synergistically. 

If we are interested in what social behaviors or phenotypes evolve in the long-term, we can 
set Aq = in equation (jlip and solve for candidate behaviors or phenotypic values that are 
evolutionarily stable. Setting Aq = is equivalent to setting the expression in (|12p equal to zero 
(i.e., 5 = 0). In order to ensure that these candidate phenotypes are actually stable, we need 
to check whether populations can evolve or converge towards these candidate phenotypes starting 



from neighboring phenotypes 



36, 



59l . |6U ] . This kind of evolutionary stability, called convergence 



stability, is often more important in the long-term evolution of continuous phenotypes than the 



traditional notion of strict evolutionary stability 



6l(]. 



A candidate phenotypic value is convergence stable when the derivative of the dAq/ d5 with 
respect to the resident phenotypic value p is negative [36J. This condition amounts to 



d 
dp 



dAq 
~df 



^ dpa5 + d6 95 + dc95 
dp dp dp db dp dc 



< 



(13) 



where the derivatives in (|13p are total derivatives and measure how p, b, and c change due to 
both direct and indirect changes in the phenotype p. Even in this generic form, condition (|13|) tell 
us two important things about the what influences the long-term evolutionary stability of social 
behaviors. First, evolutionary stability depends on how a change in phenotype affects the level of 
responsiveness, and how responsiveness changes selection for that phenotype, Thus, it is 
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not only the level of reciprocity that matters, but the sensitivity of reciprocity levels to changes in 
the underlying phenotype. 

Second, condition (|13p says that stability depends on how changes in the phenotype result in 
changes in the benefits and costs, and respectively. Recall that b and c are defined in terms 
of derivatives of fertility with respect to actions, so jjp and generate second derivatives of fer- 
tility respect to the actions of individuals. These second derivatives of fertility roughly describe 
the economics of how the behaviors of individuals combine to produce offspring. For example, the 
derivative of a focal individual k's fertility with respect to its own action and the action of another 
individual I, af*da* > measures the economic complementarity or substitutability of the actions of 

k I 

two different individuals. When this derivative is positive, actions are complementary and increas- 
ing effort by one individual increases the fertility benefit of effort by another individual. A negative 
derivative indicates substitutable actions and the reverse effect on a fertility benefit. Complemen- 



tary actions can be thought of as positive ecologica 



effects on the evolution of cooperative behaviors 



synergies, which can have very significant 



28, 



411 ] . Effectively, non-zero values of these 



second derivatives indicate that payoffs as a function of phenotype are non-additive. However, our 
increase condition for the mutant allele did not only accounts for first order effects and thus does 
not include non-additive effects. Thus, in a model with weak effect mutants, the potential ESS 
phenotypes are unaffected by non-additivity, whereas the stability of those phenotypes is affected. 

4. Applications 

4-1. Assortment and reciprocity 

How does the general condition in ([6]) compare to the conditi on that Fletcher &; D oebeli \l\ 



Fletcher Doebeli 



argue that 



derive (their equation 2.2) using a single index for assortment? 
we need to compare the average interaction environment for cooperators and defectors, which will 
depend not only on the interaction probabilities, r c and r^, but also how much each genotype ends up 
playing C and D agains intrinsic cooperators and defectors, respectively. For intrinsic cooperators, 
the frequency of stage games where the opponent plays C, which we denote by e c to match Fletcher 
and Doebeli's notation, is given by: e c = r c + (1 — r c )(q + (1 — q)(ni + ^3)), where the last term 
is simply the sum of the fraction of games between a C- and -D-type where the outcome is (C, C) 
and (D,C) (calculated at the stationary distribution). Similarly, = (1 — r^q^i + ^2). These 
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coefficients now capture Fletcher and Doebeli's notion of the average interaction environment. But 
if we followed Fletcher and Doebeli's procedure, we would arrive at the following condition 

b - > 1 = 1 ~ *** , (14) 

c e c -e d r c (l - p d )(l - q) + r d q(l - p c ) + p d (l - p c ) + q(p c - p d ) 

Comparing this expression with ©, we can see that it is not the correct condition for the increase 
of cooperators. Why does Fletcher and Doebeli's method fail? It fails simply because reciprocity 
and other types of behavioral responses affect a genotype's fitness not only through changing the 
social environment a genotype experiences, but also through changing the phenotype expressed 
by the same genotype in response to this social environment. Fletcher and Doebeli's method of 
partitioning selection into effects due to own phenotype and others' phenotypes remains valid, 
but assortment in their analysis only affects the others' phenotypes part of the partition. In 
contrast, behavioral responses affect the focal individual's own phenotype as well. Therefore, the 
assertion that behavioral responses and non-random interactions are only two special cases of the 
same fundamental quantity, assortment, does not hold. Rather, both factors have distinct (albeit 
symmetric) effects on the selection pressure on social behaviors. 

Incidentally, Fletcher and Doebeli were not the first ones to make this erroneous claim. In 
an influential paper, Queller jfj] states that his derivation of the Hamilton's rule accounting for 
multiplicative effects is valid regardless of whether the the covariance between the focal individual's 
genotype is caused by assortment of genotypes or conditional behaviors. However, as we show above, 
this statement is only true if only one these factors are in place. In the presence of both genetic 
assortment and behavioral responses, the covariance between focal genotype and partner phenotype 
is not enough to account for the selection pressure, since the covariance between the focal genotype 
and the focal phenotype is altered as well. Below, we show that a more recent formulation of social 



selection by Queller 15] suffers from a different but related problem. 

4-2. Separating kin and kith 

We now use the tools derived above to reconsider one of the recent and more prominent attempts 



to disentangle the different causal pathways for shaping the evolution of social traits. Queller |15 ] 



1 Fletcher and Doebeli use the "total benefit" parametrization of the public goods game, as opposed to our "other- 
only" definitions, so we also translate our b and c to match theirs. 



15 



provides an "expanded Hamilton's rule" given by the following inequality 
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, equation 1] 



c + bi ri + y~i di Si + mi fj > ° 



(15) 



where c, as usual, is the cost of a cooperative act to the focal individual, bi is the benefit accruing 
to the i-th relative of the focal individual, rj the relatedness coefficient between the focal individual 
and the i-th relative. The third and fourth terms represent an expansion of the classical Hamilton's 
rule to include multiplicative effects and behavioral feedbacks, which Queller calls "kind" and "kith" 
selection, respectively. Of these terms, kind selection (YlidiSi) involves a coefficient quantifying 
the deviation from additivity di for the i-th interaction partner, which is a "synergistic" fitness 
effect, and a synergy regression coefficient Sj. Finally, the last term fi m i) ^ s composed of 
a coefficient measuring how the i-th social partner's phenotype responds to the focal individual's 
phenotype fi, times the fitness effect of this response m,j. Queller suggests that this expansion of the 
phenotypic version of Hamilton's rule represents more types of fitness effects in a causal manner 
than the classical form, and therefore might lead to better disentangling the various pathways 
through which selection acts on social behaviors. While we agree with the basic premise, we show 
below that equation (I15p lacks a term reflecting the interaction between relatedness and behavioral 
feedbacks and is therefore correct only under restrictive assumptions. 

To see this, we simplify equation ()15f) by dropping the "kind" selection component (i.e. setting 
d = 0) and obtain 



The equivalent increase condition using our population genetic model is S > from equation (|12p 
for pairwise social interactions (n = 2), which yields 



for the case where the behavioral phenotype affects of the fertility of all classes equally (i = 1 
in equation [T6|) . The discrete action model yields an equivalent result in expression ([7]) . In our 
model, p measures behavioral responsiveness, hence it should be analogous to the kith selection 
feedback coefficient, /, of Queller (specifically, p should be a function of /). This allows us to 




(16) 



c(l + pr) + b(r + p) > 



(17) 
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compare equations (fT6|) and (pT7|) . which reveals that equation (fTT|) has an additional term, —rpc, 
not included in equation (|16p : this means that a generalized Hamilton's rule that includes both kin 
and kith selection cannot be constructed by simply adding the kin and kith components together. 

The problem in equation (|15p stems from Queller's assumption, without any explicit justifica- 
tion, that the behavioral feedbacks and indirect fitness effects can be decomposed additively. This 
is apparent in the description of his equation (1), where he asserts that the behavioral feedback 
terms simply arise from decomposing direct fitness effects (i.e., the effect of the focal individual's 
behavior on its own fitness) into non-social and social components. This assumption is correct only 
under very restrictive circumstances, namely when the only individuals that respond to the focal 
individual are those with zero relatedness to the focal individual, whereas related individuals do 
not respond. In other words, there are two classes of social partners that exert selection on the 
same behavioral trait: related conspecific partners that do not exert kith effects but do have kin 
effects and unrelated or heterospecific partners that do exert kith effects. In that case, the increase 
condition becomes (see Appendix O for derivation) 

-c+b(p + r) > 0, 

which is equivalent to the additive combination of kin and kith components in equation (|16p . 
However, in most interactions among social partners with responsive phenotypes, both related and 
unrelated individuals will exhibit behavioral feedbacks. In that case, as (|12p shows, both direct 
and indirect fitness effects will be functions of behavioral responses (p), and Queller's additive 
separation of kin and kith effects becomes incorrect. 

5. Discussion 

One of the major goals of evolutionary and behavioral ecology is to elucidate the casual biolog- 
ical pathways that drive the evolution of social traits. However, natural selection rarely operates 
through a single pathway, and therefore the various causal components of social evolution need to 
be integrated and their commonalities and interactions explored. Our main goal in this paper is 
to contribute to this synthesis effort. In so doing, we highlight that genetic assortment, behavioral 
responses, and non-additive interactions between phenotypes all have both distinct and interact- 
ing effects. Therefore, combining these components requires an explicit and consistent modeling 
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approach. While some of our results (like those relating relatedness and reciprocity) are found in 



previous literature 



11 



24 



25, 



621 ]. their general implications are not widely appreciated, or in some 
cases, there are misleading claims about these implications. The results that we emphasize in this 
paper are as follows. 

(i) Phenotypic assortment is not a univariate quantity: it matters how much assortment comes 
from both behavioral responses and genetic assortment. Further, behavioral responses and genetic 
assortment cannot simply added to each other, even when interactions are additive. Rather, they 
interact with each other both jidditively and multiplicatively. Prior results analogous to this can be 



found in Lehmann & Keller 



111, McGlothlin et al. Q 



McGlothlin et al. [24| and Akgay & Van Cleve [25|]. Yet, despite 



this interaction being known for some time, its crucial role seems not widely appreciated. Akgay & 



Van Cleve [25| is the first paper to our knowledge to highlight the synergistic and symmetric effect 
of genetic assortment and behavioral responses on the response to selection of social phenotypes. In 
that work, we demonstrated how synergism between genetic assortment and behavioral responses 
can facilitate the transfer of fitness effects from the within-group level to the between-group level, 



631 ] . These previous results assumed 



a crucial step in evolutionary transitions in individuality 
continuous phenotypes; here we show that a similar synergism is found in interactions with discrete 
phenotypes. 

(ii) Behavioral responses and reciprocity do not require non-additive genetic or phenotypic 
interactions, despite assertions to the contrary suggestions that measuring reciprocity requires 
non-additive terms due to frequency dependent selection [j], 15, 13]. This is a specific example 
of a more general confusion regarding the connection between frequency dependent selection and 
weak selection approximations of evolutionary dynamics in structured populations that generate 
expressions such as (I12p that are frequency independent. In general, behavioral responses and 
reciprocity will generate both additive and non-additive interactions where the additive interactions 
can be thought of as the average effect of reciprocity and non-additive interactions contribute higher 
order statistical moments. Including only additive interactions and capturing only the average effect 
of reciprocity still generates expression (|12p . which captures the intuitive and crucial notion that 
reciprocity correlates phenotypes and thus can lead to increasing levels of cooperation. 

(iii) Although measuring behavioral responses and reciprocity does not require including non- 
additive interactions, non-additive interactions can still be important, even in models with weak 
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selection and continuously varying phenotypes. Specifically, non-additive payoffs or fitness will 
affect the evolutionary (convergence) stability of a phenotype and how the stable phenotype changes 
with underlying parameters. The former effect can be seen in equation (|13p . which shows how non- 
additivity in the benefits and costs of a social phenotype (behavior) appear in the convergence 
stability condition. Non-additivity in benefits and costs relate directly to ecological mechanisms 
that convert the effort of individuals in the social interaction into payoffs in terms of fertility or 
survival. For example, individuals who put effort into a group hunt (which might be modeled 



by a stag- hunt game [39j) may receive more return in terms of the amount of prey item per 
unit of caloric investment when more individuals join the hunt. This kind of ecological synergy, or 

of investment in cooperative 



"complementarity" in economics 
interactions both within groups 



311 ] , can have large effects on the leve 



25 



411 ] . It is important to note 



28| and in structured populations 
that synergies in discrete models can also model complementarity; for example, the positive values 
of the parameter d in expression ((9|) can also represent complementary actions (and negative values 
represent the opposite, called "substitutable" actions), though non-additive interactions generate 
frequency dependence in such models. 

5.1. Approximate and exact models of evolutionary change 

Fundamentally, there are two main mathematical approaches to modeling the effect of selection 
on continuous traits. The first approach, and the one that we employ, uses a first-order Taylor series 
to approximate the effect of selection when it is weak and the mutant and resident phenotypes are 
similar (i.e., small 8). This approach is justified by the utility of the first-order term in calculating 



27|, 



the evolutionary stability of specific phenotypes 
will not describe the evolutionary dynamics very well when se 
higher effects matter. In contrast, the second approach 38 



421 ] . However, the first-order approximation 
ection is strong and second order and 



48, 



53, 



64 



661 ] fits a linear regression 



to the full change in allele frequency or mean phenotype. The result is that the linear regression 
represents the exact partial change in allele frequency or mean phenotype due to natural selection 



53, 



67 



68j. The generality of such an approach comes, however, at the cost of a "clear separation 



between genetic and ecological parameters" [p. 24 in ref [35j. For example, the partial change 
approach makes it impossible to tell when ecological mechanisms generate frequency dependent 
selection that might result in the maintenance of phenotypic variation. Also, mechanistic non- 
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additive interactions, such as synergistic payoffs 34] and cpistasis 69], become buried within linear 
regression coefficient, which makes it difficult to tell when such interactions are important. We feel 
that the regression approach is a useful tool for qualitatively describing evolutionary change, while 
the Taylor series approach is better suited for studying the mechanisms underlying the evolution 
of social behaviors. 

5.2. Empirical issues 

The empirical study of the interplay between behavioral responses and genetic assortment and 
the role of synergistic payoffs is still relatively new. Most of the work on the linkage between 
behavioral responses and genetic assortment comes from the now rapidly growing application of 



indirect genetic effect ( 



7Mm. IGE models 



24 



GE) models to laboratory, agricultural, and field-based populations [e.g. 



80 



811 ] extend classical quantitative genetics 



821 ] to include a coefficient, 



ip, that measures the effect of a social partner's phenotype on the phenotype of a focal individual. 
Importantly, the IGE coefficient -0 directly maps to our behavioral responsiveness coefficient p 2f| 
and is thus also a measure of reciprocity. We know from equation (|12p (and analogously equation 
18 in McGlothlin et al. \24 ) that an interaction between behavioral responses and relatedness is 
important for selection on social traits, even when no such interaction is assumed to affect individual 
phenotypes (see equation IC.5I in the Appendix or equation 5 in ref [24|). However, most of the 



empirical work to date spends little time on the interaction of 



H 



behavioral responses and relatedness 
77 ] , and only Frere et al. [3] emphasize 



with the exception of Bleakley &; Brodie [75[ and Frere et al. 
the crucial role of this interaction. The empirical work assumes individual phenotypes to have an 
explicit interaction term, and it is not yet clear how empirical estimates of this interaction can be 
tied to predicted interaction term that arises in (|12p that measure the effect of selection on the trait. 
Thus, more work needs to be done understanding the interaction between behavioral responses and 
relatedness both theoretically and empirically. 

Empirical studies explicitly focused on non-additive payoffs seem scarce, except in the context 



of green-beard genes 



83] , e.g.]. We do not believe that this state of affairs reflects the importance 



of non-additive interactions between phenotypes in nature. Instead, non-additive and non-linear 
interactions are likely to be abundant and their effects strong. For example in parental care, 
many arguments implicitly or explicitly rely on the contributions each parent combining non- 
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additively [28|,|84j]. We suggest that measuring non-additive interactions at both the payoff (fertility 
or survival) and fitness levels will prove important for explaining the evolution of social behaviors, 
regardless of whether the phenotypes in question vary more or less continuously or are discrete 
characters with large effects. 

To conclude, a careful analysis of behavioral responses, genetic assortment, and non-additive 
interactions shows a complex interaction between these three pathways. We believe that focusing 
on this complexity will ultimately deliver a more complete and nuanced understanding of the 
evolutionary forces shaping social behaviors. The necessary mathematical frameworks for such 
understanding exists, though as we show in this paper, it needs to be applied with some care. Once 
crafted, these theories can be tested against the growing abundance of sophisticated datasets that 
measure both networks of social interactions as well as important demographic and fitness-related 



variables [e.g. 



85, 



86] 
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Appendix A: Two-player game with synergy 

We follow the discrete-action model denned in section 12.11 Individuals are either intrinsic 
cooperators or intrinsic defectors who can either play their intrinsic strategy or reciprocate their 
partner's last action. Let p c and pd be the probabilities that an intrinsic cooperator and defector, 
respectively, reciprocate or copy their partner's last action. We allow for non-randomly interaction 
among intrinsic cooperators and intrinsic defectors that might results from population structure or 
other assortation mechanisms and define r c and to be the probabilities that intrinsic cooperators 
and defectors, respectively, interact among themselves non-randomly. Two cooperators who play 
each other each earn payoff b — c + d, where b is the benefit, c is the cost of cooperation, and d is 
a synergy term that account for non-additive payoffs. A cooperator and defector playing together 
results in a payoff of — c for the cooperator and b for the defector. Two defectors playing each other 
each earn zero payoff. Given this setup and following the analysis of section 12. 1| the payoffs to an 
intrinsic cooperator paired with an intrinsic defector and vice-versa are 



w D c 



(1 - p c ) {{bp d - c)(l - p c pd) + dp d {l - p c )) 

(1 - PcPd) 2 

(1 - pc) {{b - Pdc)(l - PcPd) + dp d {\ - pc)) 



(1-Pc 



respectively, which are also given in equation ([8]) in the main text. 

We can plug the above values for wcd and wdc, along with wcc = b — c + d and wdd = 0, 
into the expression for fitness in equation ([5]) to derive the condition for cooperation to increase, 
Wc — Wd > 0, which simplifies to 

h ,' ppc + (1 - p)Pd + r c (l -p)(l- p d ) + r d p(l - p c ) - p c pd 

1 - PcPd 

1 - (1 - p)p c - ppd + (1 - p)r c p c (l - p d ) + r d pp d (l - p c 

T^cjd ; ^ 

f(p + r c (l - p))(l - p d ) (1 - p 2 c Pd) + (1 - P + pr d ){l ~ Pc?Pd\ n 
+d { W^d? J >0 - 

This expression simplifies to condition ([6]) when d = 0. Another simplification is to assume that 
the reciprocation and assortment probabilities are the same for cooperators and defectors, namely 
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Pc = Pd = P an d r c = r^ = r. In this case, expression (|A.1|) simplifies to 

^_ c ^ +d ( (p+K1 _ ri) + ^(l_^) |> „, (A ,, 

which in turn simplifies to condition ((7]) when d = 0. 
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Appendix B: Allele frequency change in a class structured model 

In section [3l we build a general class-structured model of allele frequency change and present a 
first-order approximation, expression (|12|) . that is valid given the phenotype of one individual can 
affect the fertility of another individual so long as both individuals are in the same class and the 
effect of phenotype on fertility is independent of class. Our goal in this section is to show how to 
arrive at expression (|12|) beginning with 



dAq dW ^ ^ J^v dwin 

-df = a -^f cl= ^ LL^w- ( R1 ) 

i,j,l=l k=l m=l 

Recall that the Taylor series is expanded around 5 = 0, so the partial derivative g^ jfc in (jB.ip 
are evaluated at neutrality where natural selection on fertility differences is absent. Additionally, 
5 = implies that all individuals in a class are equivalent, which means that the partial derivatives 
above fall into three cases: the first case is the effect of a focal individual's phenotype on its own 
probability of sending an allele to class j in the next generation from its own class i, j^r-', the 
second is the effect on that probability of another individual's phenotype from the same class as 
the focal, 75^-; and the third is the effect on that probability of the phenotype of an individual in a 

Op jo 

different class than the focal individual, where I 7^ j . Substituting these derivates into (|B.1|) , 
we obtain 



dW x— -v I dwi-j—^r , , v — * uii , 1 1 




Further, we assume that the phenotype only affects fertility (though it could affect survival or 
demographic factors such as carrying capacity or dispersal rates), which means the derivaties in the 
left-hand side of the equations in (jB.3[) can be split into two components, the effect of the phenotype 
on fertility and the effect of fertility on the probability an allele in class i descends from class j. 
The latter derivatives we represent with r where t^j. = ^ iL , Hj,jo = wjr^i an d T^y = ^jrf for 
I ^ j. These derivatives essentially contain transmission probabilities due to dispersal, mutation, 
allelic segregation, age-dependent mortality, etc, and we assume they are independent of phenotype. 
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Given these two classes of derivatives, 



dw. 



dp, 



In, 



dwi 



dp j a ~ n hJ»dp 



dfij 



L r % 3>3°~5p~~ 



+ T ij,j 

dpi ~ T V> 1 {dpi. ^ L hpio 



dfij_ 
)'• 

dfu 



it+(^-2)it 



for (l,m) = (j,k) 
for / = j and k 
for I / j . 



(B.3) 



Finally, we assume that the effect of phenotype on fertility is independent of the source or 
destination classes of the offspring; that is, we can define —C 
Combining C and B with (|B.3P into equation (|B.2p results in 



dfij 

dp 3 » 



dfij 
dp jo 



df 

dp a ' 



a 



dW 
~d5~ 



C 



B 



K 

Y oiiNj(Nj - 1) [Tijj,q], + {T ijd . + (Nj - 2)Tyj ) + Nj^NiiNi - l)T ijt iqJqi 



--K c (-C + 715) , 



(B.4) 



where Kc and -?Tb equal the first and second bracket in (|B.4p . respectively. Scaled relatedness is 
given 1Z = Kb/Kc- The terms in parentheses on the last line of (|B.4p are the same as the right- 
hand side of equation (|12|) . once C and B are evaluated following the assumptions in that section 
that the phenotype p is a parameter of some physiological/psychological mechanism that affects the 
underlying actions individuals choose, a*, and the marginal effect of the action of one individual 
on the action of another is p. A detailed exposition of this analysis is given by Akgay <fe Van Cleve 
25 1 where — C corresponds to j^- in equation (2) and B to in equation (3) in that paper. In 
most biological cases of interest, Kc > 0, which results in the proportionality in equation (|12D . 
The convergence stability condition in (|13p can be derived from equation (]12p simply by recalling 
the assumption that the phenotype affects fertility but not demographic quantities contained in 1Z. 
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Appendix C: Uni-directional and bi-directional reciprocity in indi- 
rect genetic effects models 

In this section, we will show that: (i) assuming a uni-directional type of reciprocity where the 
focal individual can affect its partner's phenotype, but not vice-versa, can lead to the expanded 



Hamilton's rule presented by Queller 



la . equation 1]; and (ii) bi-directional 



equation (jXTJ) , which is equivalent to results derived by us [25J and others 



11 



reciprocity leads to 



2J] . Notation in the 



following equations is taken from Queller [15j and generally follows quantitative genetic approaches 



to social evolution [e.g . 



(IGE) models 



24 



80, 



15, 



fill 



38 



53] with a particular focus on the methods in indirect genetic effect 



We begin with the pi idiotypic version of Hamilton's rule, derived IVorn I he Price equation 



42J 



that is equation (7) in Queller [15|, which is 



Pwp-P' + Pwp'-p 



Cov [G, P'} 
Cov[G,P] 



(C.l) 



where we use primes to denote variables associated with the social partner. Suppose that the 
phenotype of the social partner is defined as 



P' = G' + /3p> P P + e' 



(C.2) 



which says that the phenotype of the social partner is a linear function of its average breeding value 
(G'), the effect of the phenotype of the focal (/3p>pP), and a random component with mean zero 



(e', a standard assumption in quantitative genetics; see ref 
of focal individual is given by 

P = G + e, 



82j). Suppose also that the phenotype 

(C.3) 



which says the phenotype of the focal individual is a linear function only of its breeding value and a 
random component; thus, there is no effect of the partner's phenotype on the focal, which embodies 



the assumption we believe underlies Queller [15('s analysis. Plugging P and P' given in equations 
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(|Cl2"j) and (j03|) into equation (fGTj) yields 



a +R Gov [G, G' + /3p, P P + e'} _ 

PWP-P' + PWP'P m ^ i — i ~~ PWP-P' + PWP'-P KPP'P + PG'G) 

Cov|G, G + ej 



(C.4) 



which can be written as 



-c + b(p + r) 



in the notation of the main text and is equivalent to the additive combination of kin and kith 
components in equation (1) of Queller 

To include bi-directional reciprocity, we follow the analysis presented by McGlothlin et al. 
Akcay & Van Cleve 25(] obtain the same results and present a comparison of the relative merits of 



the current approach and the IGE approach. We start by setting 

P = G + p Ppl P' + e , 



(C.5) 



which expresses the fact that P' also has an effect on P. Before we can plug this definition of P or 
the definition of P' into equation (7) of Queller [15j, we must insert our definition of P' into (fCl)j) 
and our definition of P into (|0.2|> and solve the resulting equations; the solutions are 



P 
P' 



G + e + /3 PP/ (G' + e') 
1 — P p pi (3 pip 

G' + e' + f3 P ip(G + e) 
1 — Pp pi (5 pip 



(C.6) 



These solutions can be compared to the analogous solution in McGlothlin et al. 
Plugging equations (1C,6|) into equation (7) of Queller [la] yields 



24| (equation 6). 



Pwp-p' + Pwp'p- 



Cov 


G, 


G'+e'+l3p,p(G+e)~ 


l—/3p p ippi P 


Cov 


G, 


G+e+Ppp,{G'+e')' 


1—Ppp'Ppip 



Pwp-p' + PwP'P 

Pwp-p' + Pwp'-p 
Pwp-p' + Pwp'-p 



Cov [G,G' + e' + f3pi P (G + e)} 
Cov[G,G + e + (3 P p> (G f + e')} 

Cov [G, G'} + fipi P Var[G] 
Var[G] +p PP i Cov[G,G'] 
Pg'g + Pp'p 

1 + PppiPc'G 
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Thus, the Hamilton's rule one gets from equation (7) of Queller [15[ should be 



Q 



Pwp p' + Pwp'-p f G % + P > 
J- + ppp'Pg'g 



or 



(1 + Ppp'Pg'g) fiwP-P' + Pwp'-p (Pg'g + Pp'p) > 



(C.7) 



Assuming that (3 pip = (3pp/, equation (|C.7|) becomes in our notation 



-c(l + pr) + b(r + p) > 0, 



which is exactly equation (|17l) from our model. This is also exactly equivalent to equation (18) 



given by McGlothlin et al. 



241 ] for pairwise interactions once the appropriate mapping between p 



and their IGE coefficient ip is made [see Appendix A9 in ref 



25J. 
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Figure 1: The different possibilities for how a prisoner's dilemma game with reciprocity and non- 
additive payoffs can get transformed, depending on the probabilities of reciprocation p and the 
non-additive payoff or synergy component d. 
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